Title of Thesis: Fast Modular Exponentiation Using Residue Domain Representation: a Hardware Implementation and Analysis Title of Thesis: Fast Modular Exponentiation Using Residue Domain Repre- Sentation: a Hardware Implementation and Analysis

نویسندگان

  • Christopher Dinh Nguyen
  • Alan T. Sherman
  • Dhananjay S. Phatak
چکیده

Title of Thesis: Fast Modular Exponentiation Using Residue Domain Representation: A Hardware Implementation and Analysis Christopher Dinh Nguyen, Master of Science, 2013 Thesis directed by: Alan T. Sherman, Associate Professor Department of Computer Science and Electrical Engineering Dhananjay S. Phatak, Associate Professor Department of Computer Science and Electrical Engineering Using modular exponentiation as an application, we engineered on FPGA fabric and analyzed the first implementation of two arithmetic algorithms in Reduced-Precision Residue Number Systems (RP-RNS): the partial-reconstruction algorithm and quotient-first scaling algorithm. Residue number systems (RNS) provide an alternative representation to the binary system for computation. They offer full parallel computation for addition, subtraction, and multiplication. However, base extension, division, and sign detection become harder operations. Phatak’s RP-RNS uses a time-memory trade-off to achieve O (lgN) running time for base extension and scaling, where N is the bit-length of the operands, compared with Kawamura’s Cox-Rower architecture and its derivatives, which appear to take O (N) steps and therefore O (N) delay to the best of our knowledge. We implemented the fully parallel RP-RNS architecture based on Phatak’s description and architecture diagrams. Our design decisions included distributing the lookup tables among each channel, removing the adder trees, and removing the parallel table access thus trading size for speed. In retrospect, we should have hosted the tables in memory off the FPGA. We measured the FPGA utilization, storage size, and cycle counts. The data we present, though less than optimal, confirms the theoretical trends calculated by Phatak. FPGA utilization grows proportional K logK where K is the number of hardware channels. Storage grows proportional to O (N lg lgN). When using Phatak’s recommendations, cycle count grows proportional to O (lgN). Our contributions include documentation of our design, architecture, and implementation; a detailed testing methodology; and performance data based on our implementation to enable others to replicate our implementation and findings. Fast Modular Exponentiation Using Residue Domain Representation: A Hardware Implementation and Analysis by Christopher Dinh Nguyen Thesis submitted to the Faculty of the Graduate School of the University of Maryland, Baltimore County in partial fulfillment of the requirements for the degree of Master of Science 2013

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Analysis of Rns Montgomery Exponentiation Algorithm for Cryptography

The hardware implementation of modular exponentiation for very large integers is a well-known topic in digital arithmetic. An effective approach for obtaining parallel and carry-free implementations consists in using the Montgomery exponentiation algorithm and executing the necessary operations in RNS. Two efficient methods for performing the RNS Montgomery exponentiation have been proposed by ...

متن کامل

An Enhanced Residue Modular Multiplier for Cryptography

This paper presents an implementation of VLSI architecture for Dual Field Residue Arithmetic modular multiplier with less delay based on finite field arithmetic to support all public key cryptographic applications. A new method for incorporating Residue Number System (RNS) and Polynomial Residue Number system (PRNS) in modular multiplication is derived and then implemented VLSI Architecture for...

متن کامل

Public Key Cryptography on Modern Graphics Hardware

Graphics processing units (GPUs) are increasingly being used for general purpose processing. We present implementations of large integer modular exponentiation, the core of public-key cryptosystems such as RSA, on a DirectX 10 compliant GPU. DirectX 10 compliant graphics processors are the latest generation of GPU architecture, which provide increased programming flexibility and support for int...

متن کامل

Efficient Acceleration of Asymmetric Cryptography on Graphics Hardware

Graphics processing units (GPU) are increasingly being used for general purpose computing. We present implementations of large integer modular exponentiation, the core of public-key cryptosystems such as RSA, on a DirectX 10 compliant GPU. DirectX 10 compliant graphics processors are the latest generation of GPU architecture, which provide increased programming flexibility and support for integ...

متن کامل

An Abstract of the Thesis of High-speed Algorithms & Architectures for Number-theoretic Cryptosystems

approved: C etin K. Ko c Computer and network security systems rely on the privacy and authenticity of information, which requires implementation of cryptographic functions. Software implementations of these functions are often desired because of their exibility and cost e ectiveness. In this study, we concentrate on developing high-speed and area-e cient modular multiplication and exponentiati...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013